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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Ryazanov, Alexey G. 

Hait, William N. 
Pavur, Karen S. 



(ii) TITLE OF INVENTION: ELONGATION FACTOR- 2 KINASE (EF-2 KINASE) 
AND METHODS OF USE THEREFOR 

(iii) NUMBER OF SEQUENCES: 2 5 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: David A. Jackson, Esq. 

(B) STREET: 411 Hackensack Ave, Continental Plaza, 4th 

Floor 

(C) CITY: Hackensack 

(D) STATE: New Jersey 

(E) COUNTRY: USA 

(F) ZIP: 07601 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Fioppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

( C ) C LAS S I F I CAT I ON : 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Jackson Esq., David A. 

(B) REGISTRATION NUMBER: 26,742 



(C) REFERENCE /DOCKET NUMBER: 601-1-078 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-487-5800 

(B) TELEFAX: 201-343-1684 

(2) INFORMATION .FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2178 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGGCAGACG AAGACCTCAT CTTCCGCCTG GAAGGTGTTG ATGGCGGCCA GTCCCCCCGA 

GCTGGCCATG ATGGTGATTC TGATGGGGAC AGCGACGATG AGGAAGGTTA CTTCATCTGC 

CCCATCACGG ATGACCCAAG CTCGAACCAG AATGTCAATT CCAAGGTTAA TAAGTACTAC 

AGCAACCTAA CAAAAAGTGA GCGGTATAGC TCCAGCGGGT CCCCGGCAAA CTCCTTCCAC 

TTCAAGGAAG CCTGGAAGCA CGCAATCCAG AAGGCCAAGC ACATGCCCGA CCCCTGGGCT 

GAGTTCCACC TGGAAGATAT TGCCACCGAA CGTGCTACTC GACACAGGTA CAACGCCGTC 



10 
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ACCGGGGAAT GGCTGGATGA TGAAGTTCTG ATCAAGATGG CATCTCAGCC CTTCGGCCGA 42 0 

GGAGCAATGA GGGAGTGCTT CCGGACGAAG AAGCTCTCCA ACTTCTTGCA TGCCCAGCAG 4 80 

5 TGGAAGGGCG CCTCCAACTA CGTGGCGAAG CGCTACATCG AGCCCGTAGA CCGGGATGTG 54 0 

TACTTTGAGG ACGTGCGTCT ACAGATGGAG GCCAAGCTCT GGGGGGAGGA GTATAATCGG 6 00 

CACAAGCCCC CCAAGCAGGT GGACATCATG C AGATGTG C A TCATCGAGCT GAAGGACAGA 660 

CCGGGCAAGC CCCTCTTCCA CCTGGAGCAC TACATCGAGG GCAAGTACAT CAAGTACAAC 72 0 

TCCAACTCTG GCTTTGTCCG TGATGACAAC ATCCGACTGA CGCCGCAGGC CTTCAGCCAC 780 

15 TTCACTTTTG AGCGTTCCGG CCATCAGCTG AT AGTGG TGG ACATCCAGGG AGTTGGGGAT 840 

U'j CTCTACACTG ACCCACAGAT C C AC AC GG AG ACGGGCACTG ACTTTGGAGA CGGCAACCTA 90 0 

GGTGTCCGCG GGATGGCGCT CTTCTTCTAC TCTCATGCCT GCAACCGGAT TTGCGAGAGC 960 

ATGGGCCTTG CTCCCTTTGA CCTCTCGCCC CGGGAGAGGG ATGCAGTGAA TCAGAACACC 102 0 

AAGCTGCTGC AATCAGCCAA GACCATCTTG AGAGGAACAG AGGAAAAATG TGGGAGCCCC 1080 

25 CGAGTAAGGA CCCTCTCTGG GAGCCGGCCA CCCCTGCTCC GTCCCCTTTC AGAGAACTCT 114 0 

GGAGACGAGA ACATGAGCGA CGTGACCTTC GACTCTCTCC CTTCTTCCCC ATCTTCGGCC 12 00 

ACACCACACA GCCAGAAGCT AGACCACCTC CATTGGCCAG TGTTCAGTGA CCTCGATAAC 12 60 

30 

ATGGCATCCA GAGACCATGA TCATCTAGAC AACCACCGGG AGTCTGAGAA TAGTGGGGAC 13 2 0 

AGCGGATACC CCAGTGAGAA G CGGGGTGAG CTGGATGACC CTGAGCCCCG AGAACATGGC 13 8 0 

35 CACTCATACA GTAATCGGAA GTACGAGTCT GACGAAGACA GCCTGGGCAG CTCTGGACGG 14 4 0 

GTATGTGTAG AGAAGTGGAA TCTCCTCAAC TCCTCCCGCC TCCACCTGCC GAGGGCTTCG 150 0 



nj 2 o 
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GCCGTGGCCC TGGAAGTGCA AAGGCTTAAT GCTCTGGACC TCGAAAAGAA AATCGGGAAG 1560 

TCCATTTTGG GGAAGGTCCA TCTGGCCATG GTGCGCTACC ACGAGGGTGG GCGCTTCTGC 162 0 

5 GAGAAGGGCG AGGAGTGGGA CCAGGAGTCG GCTGTCTTCC ACCTGGAGCA CGCAGCCAAC 168 0 

CTGGGCGAGC TGGAGGCCAT CGTGGGCCTG GGACTCATGT ACTCGCAGTT GCCTCATCAC 174 0 

ATCCTAGCCG ATGTCTCTCT GAAGGAGACA GAAGAGAACA AAACCAAAGG ATTTGATTAC 1800 

10 

TTACTAAAGG CCGCTGAAGC TGGCGACAGG CAGTCCATGA TCCTAGTGGC GCGAGCTTTT 1860 

" I 

^ GACTCTGGCC AGAACCTCAG CCCGGACAGG TGCCAAGACT GGCTAGAGGC CCTGCACTGG 192 0 

15 TACAACACTG CCCTGGAGAT GACGGACTGT GATGAGGGCG GTGAGTACGA CGGAATGCAG 198 0 

m 

U1 GACGAGCCCC GGTACATGAT GCTGGCCAGG GAGGCAGAGA TGCTGTTCAC AGGAGGCTAC 2 04 0 



nj 20 



GGGCTGGAGA AGGACCCGCA GAGATCAGGG GACTTGTATA CCCAGGCAGC AGAGGCAGCG 2100 

ATGGAAGCCA TGAAGGGCCG ACTGGCCAAC C AGT ACT AC C AAAAGGCTGA AGAGGCCTGG 216 0 

GCCCAGATGG AGGAATAA 217 8 
25 (2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 5 amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ala Asp Glu Asp Leu lie Phe Arg Leu Glu Gly Val Asp Gly Gly 
1-5 10 15 

Gin Ser Pro Arg Ala Gly His Asp Gly Asp Ser Asp Gly Asp Ser Asp 
20 25 30 

Asp Glu Glu Gly Tyr Phe lie Cys Pro lie Thr Asp Asp Pro Ser Ser 
35 40 45 

Asn Gin Asn Val Asn Ser Lys Val Asn Lys Tyr Tyr Ser Asn Leu Thr 
50 55 60 

Lys Ser Glu Arg Tyr Ser Ser Ser Gly Ser Pro Ala Asn Ser Phe His 
65 70 75 80 

Phe Lys Glu Ala Trp Lys His Ala lie Gin Lys Ala Lys His Met Pro 
85 90 95 

Asp Pro Trp Ala Glu Phe-His Leu Glu Asp lie Ala Thr Glu Arg Ala 
100 105 110 

Thr Arg His Arg Tyr Asn Ala Val Thr Gly Glu Trp Leu Asp Asp Glu 
115 120 125 

Val Leu lie Lys Met Ala Ser Gin Pro Phe Gly Arg Gly Ala Met Arg 
130 135 140 

Glu Cys Phe Arg Thr Lys Lys Leu Ser Asn Phe Leu His Ala Gin Gin 
145 150 155 160 



Trp Lys Gly Ala Ser Asn Tyr Val Ala Lys Arg Tyr lie Glu Pro Val 
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165 170 175 

Asp Arg Asp Val Tyr Phe Glu Asp Val Arg Leu Gin Met Glu Ala Lys 
180 185 190 

Leu Trp Gly Glu Glu Tyr Asn Arg His Lys Pro Pro Lys Gin Val Asp 
195 200 205 

lie Met Gin Met Cys lie lie Glu Leu Lys Asp Arg Pro Gly Lys Pro 
210 215 220 

Leu Phe His Leu Glu His Tyr lie Glu Gly Lys Tyr lie Lys Tyr Asn 
225 230 235 240 

Ser Asn Ser Gly Phe Val Arg Asp Asp Asn lie Arg Leu Thr Pro Gin 
245 250 255 

Ala Phe Ser His Phe Thr Phe Glu Arg Ser Gly His Gin Leu lie Val 
260 265 270 

Val Asp lie Gin Gly Val Gly Asp Leu Tyr Thr Asp Pro Gin lie His 
275 . 280 285 

Thr Glu Thr Gly Thr Asp Phe Gly Asp Gly Asn Leu Gly Val Arg Gly 
290 .295 300 

Met Ala Leu Phe Phe Tyr Ser His Ala Cys Asn Arg lie Cys Glu Ser 
305 310 315 320 

Met Gly Leu Ala Pro Phe Asp Leu Ser Pro Arg Glu Arg Asp Ala Val 
325 330 335 

Asn Gin Asn Thr Lys Leu Leu Gin Ser Ala Lys Thr lie Leu Arg Gly 
340 345 350 



Thr Glu Glu Lys Cys Gly Ser Pro Arg Val Arg Thr Leu Ser Gly Ser 
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Arg Pro Pro Leu Leu Arg Pro Leu Ser Glu Asn Ser Gly Asp Glu Asn 
370 375 380 

Met Ser Asp Val Thr Phe Asp Ser Leu Pro Ser Ser Pro Ser Ser Ala 
385 390 395 400 

Thr Pro His Ser Gin Lys Leu Asp His Leu His Trp Pro Val Phe Ser 
405 410 415 

Asp Leu Asp Asn Met Ala Ser Arg Asp His Asp His Leu Asp Asn His 
420 425 430 

Arg Glu Ser Glu Asn Ser Gly Asp Ser Gly Tyr Pro Ser Glu Lys Arg 
435 440 445 

Gly Glu Leu Asp Asp Pro Glu Pro Arg Glu His Gly His Ser Tyr Ser 
450 455 460 

Asn Arg Lys Tyr Glu Ser Asp Glu Asp Ser Leu Gly Ser Ser Gly Arg 
465 470 475 480 

Val Cys Val Glu Lys Trp Asn Leu Leu Asn Ser Ser Arg Leu His' Leu 
485 490 495 

Pro Arg Ala Ser Ala Val -Ala Leu Glu Val Gin Arg Leu Asn Ala Leu 
500 505 510 

Asp Leu Glu Lys Lys lie Gly Lys Ser lie Leu Gly Lys Val His Leu 
515 520 525 

Ala Met Val Arg Tyr His Glu Gly Gly Arg Phe Cys Glu Lys Gly Glu 
530 535 540 

Glu Trp Asp Gin Glu Ser Ala Val Phe His Leu Glu His Ala Ala Asn 
545 550 555 560 



Leu Gly Glu Leu Glu Ala lie Val Gly Leu Gly Leu Met Tyr Ser Gin 
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565 570 575 

Leu Pro His His He Leu Ala Asp Val Ser Leu Lys Glu Thr Glu-Glu 
580 585 590 

Asn Lys Thr Lys Gly Phe Asp Tyr Leu Leu Lys Ala Ala Glu Ala Gly 
595 600 

Asp Arg Gin Ser Met He Leu Val Ala Arg Ala Phe Asp Ser Gly Gin 
610 615 620 

Asn Leu Ser Pro Asp Arg Cys Gin Asp Trp Leu Glu Ala Leu His Trp 
625 630 635 640 

Tyr Asn Thr Ala Leu Glu Met Thr Asp Cys Asp Glu Gly Gly Glu Tyr 
645 650 655 

Asp Gly Met Gin Asp Glu Pro Arg Tyr Met Met Leu Ala Arg Glu Ala 
660 665 670 

Glu Met Leu Phe Thr Gly Gly Tyr Gly Leu Glu Lys Asp Pro Gin Arg 
675 680 685 

Ser Gly Asp Leu Tyr Thr Gin Ala Ala Glu Ala Ala Met Glu Ala Met 
690 . -695 700 

Lys Gly Arg Leu Ala Asn Gin Tyr Tyr Gin Lys Ala Glu Glu Ala Trp 
705 710 715 720 

Ala Gin Met Glu Glu 
725 

INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2175 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATGGCAGACG AAGACCTCAT CTTCTGCCTG GAAGGTGTTG ACGGTGGCAG GTGCTCCCGA 
GCTGGCCACA ATGCGGACTC TGACACAGAC AGTGACGATG ATGAGGGCTA TTTCATCTGC 
CCCATCACTG ATG AC C AC AT GTCCAATCAG AATGTCAGCT CCAAAGTCCA GAGCTACTAT 
AGCAACCTAA CAAAAACAGA GTGCGGCTCC ACAGGGTCAC CAGCCAGCTC CTTCCACTTC 
AAGGAAGCCT GGAAGCATGC GATCGAGAAA GCCAAGCACA TGCCTGACCC CTGGGCTGAA 
TTCCATCTCG AGGACATCGC CACAGAACAT GCTACTCGGC ACAGGTACAA CGCTGTCACC 
GGGGAATGGC TGAAAGACGA GGTTCTGATC AAGATGGCGT CTCAGCCCTT CGGCCGTGGA 
GCAATGAGGG AGTGCTTCAG GACGAAGAAA CTCTCCAACT TCTTGCACGC CCAGCAATGG 
AAGGGGGCCT CCAACTACGT GGCCAAGCGC TACATCGAGC CGGTGGACAG GAGCGTGTAC 
TTTGAGGATG TGCAGCTCCA GATGGAGGCG AAGCTCTGGG GGGAGGATTA CAATCGGCAC 
AAGCCCCCCA AGCAGGTGGA TATCATGCAG ATGTGCATCA TTGAGCTAAA GGACAGACCA 
GGCCAGCCCC TCTTCCACTT GGAGCACTAC ATTGAGGGCA AGTACATCAA GTACAATTCC 



65 

AACTCAGGCT TTGTCCGTGA TGACAACATC CGACTAACCC CACAGGCCTT CAGCCATTTC 
ACATTTGAGC GTTCTGGTCA TCAGCTGATT GTAGTGGACA TCCAGGGTGT GGGTGACCTT 
5 TATACCGACC CACAGATCCA CACTGAGAAA GGCACTGACT TTGGAGATGG TAACCTTGGT 
GTCCGGGGAA TGGCTCTCTT CTTCTACTCT CATGCCTGCA ACCGGATTTG TCAGAGCATG 
GGCCTTACGC CCTTTGACCT CTCCCCACGG GAACAGGATG CGGTGAATCA GAGCACCAGG 

10 

CTATTGCAAT CAGCCAAGAC CATCTTGAGG GGGACAGAGG AGAAGTGTGG GAGTCCCCGC 



30 

AAGGATGAGG AGTGGGATCG AGAGTCAGCC AT CTTC CATC TGGAGCATGC AGCTGAC CTG 



780 



840 



900 



960 



1020 



1080 



1140 



ATAAGGACAC TCTCTAGCAG CCGGCCCCCT TTGCTCCTTC GCCTGTCAGA GAACTCCGGG 
15 GATGAGAACA TGAGTGACGT GACCTTTGAC TCTCTGCCTT CCTCCCCGTC TTCAGCTACA 1200 
CCACACAGCC AGAAACTGGA CCACCTCCAT TGGCCAGTGT TTGGTGACCT CGATAACATG 1260 
GGCCCTAGAG ACCATGACCG TATGGACAAT CACCGGGACT CTGAGAATAG TGGGGACAGT 

20 

GGGTATCCAA GCGAGAAGCG AAGTGACCTG GATGATCCTG AGCCCCGAGA ACACGGCCAC 



1320 



1380 



1440 



1500 



TCCAACGGCA ACCGAAGGCA TGAATCTGAC GAGGATAGCC TGGGCAGCTC TGGACGGGTC 
25 TGTGTGGAGA CGTGGAACCT GCTCAAXCCC TCCCGCCTGC ACCTGCCGAG GCCCTCGGCC 

GTGGCCCTAG AAGTGCAGAG GCTAAATGCC CTGGACCTTG GAAGGAAAAT CGGGAAGTCT 156 0 
GTTTTGGGGA AAGTCCATTT GGCCATGGTG CGATACCACG AGGGCGGGCG CTTCTGCGAG 



1620 



1680 



GGAGAACTGG AGGCCATCGT GGGCCTAGGC CTCATGTACT CTCAGCTGCC CCACCACATC 1740 
35 CTGGCTGATG TCTCTCTGAA GGAGACAGAG GAGAACAAGA CAAAAGGCTT TGATTACTTA 1800 
CTGAAGGCGG CAGAAGCTGG TGACAGGCAT TCCATGATTT TAGTGGCCCG AGCTTTTGAC 1860 



66 

ACTGGCCTGA ACCTCAGCCC AGACAGGTGT CAAGACTGGT CGGAAGCCTT GCACTGGTAC 192 0 

AACACAGCCC TGGAGACAAC AGACTGCGAT GAAGGCGGGG AGTACGATGG GATACAGGAC 198 0 



5 GAGCCCCAGT ACGCACTGCT GGCCAGGGAG GCGGAGATGC TGCTCACCGG GGGATTTGGA 



2040 



CTGGACAAGA ACCCCCAAAG ATCAGGAGAT TTGTACACCC AGGCAGCTGA GGCAGCAATG 2100 



GAAGCCATGA AGGGCCGGCT AGCCAACCAG TACTACGAGA AGGCGGAAGA GGCCTGGGCC 216 0 



10 



CAGATGGAGG AATAA 



2175 



-it . 



U1 

s 

ru 



15 



20 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 724 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM : Mus musculus 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Met Ala Asp Glu Asp Leu lie Phe Cys Leu Glu Gly Val Asp Gly Gly 
15 10 15 



35 



Arg Cys Ser Arg Ala Gly His Asn Ala Asp Ser Asp Thr Asp Ser Asp 
20 25 30 
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Asp Asp Glu Gly Tyr Phe lie Cys Pro He Thr Asp Asp His Met Ser 
35 40 



45 



Asn Gin Asn Val Ser Ser Lys Val Gin Ser Tyr Tyr Ser Asn 
50 55 60 



Leu Thr 



Lys Thr Glu Cys Gly Ser Thr Gly Ser Pro Ala Ser Ser Phe Hi 



65 70 



s Phe 

? 5 80 



Lys Glu Ala Trp Lys His Ala He Glu Lys Ala Lys His Met Pro 



Asp 



85 9° 95 



Pro Trp Ala Glu Phe His Leu Glu 



100 105 



Asp He Ala Thr Glu His Ala Thr 

110 



Arg His Arg Tyr Asn Ala Val Thr Gly Glu Trp Leu Lys Asp Glu Val 

125 



H5 120 



Leu lie Lys Met Ala Ser Gin Pro Phe Gly Arg Gly Ala Met Arg Glu 
130 i3s 



140 



Cys Phe Arg Thr Lys Lys Leu Ser Asn Phe Leu His Ala Gin Gin Trp 

155 iso 



145 150 



Lys Gly Ala Ser Asn Tyr. Val Ala Lys Arg Tyr He Glu Pro Val Asp 



165 170 



175 



Arg Ser Val Tyr Phe Glu Asp Val Gin Leu Gin Met Glu Ala Lys Leu 
180 185 190 

Trp Gly Glu Asp Tyr Asn Arg His Lys Pro Pro Lys Gin Val Asp He 

195 200 2 05 

Met Gin Met Cys He He Glu Leu Lys Asp Arg Pro Gly Gin Pro Leu 
210 215 220 

Phe His Leu Glu His Tyr He Glu Gly Lys Tyr He Lys Tyr Asn Ser 
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225 230 235 



240 



Asn Ser Gly Phe Val Arg Asp Asp Asn He Arg Leu Thr Pro Gin Ala 
245 250 255 

Phe Ser His Phe Thr Phe Glu Arg Ser Gly His Gin Leu He Val Val 
250 265 270 

Asp He Gin Gly Val Gly Asp Leu Tyr Thr Asp Pro Gin He His Thr 
275 280 285 

Glu Lys Gly Thr Asp Phe Gly Asp Gly Asn Leu Gly Val Arg Gly Met 
290 295 300 

Ala Leu Phe Phe Tyr Ser His Ala Cys Asn Arg He Cys Gin Ser Met 
305 310 315 320 

Gly Leu Thr Pro Phe Asp Leu Ser Pro Arg Glu Gin Asp Ala Val Asn 
325 330 335 

Gin Ser Thr Arg Leu Leu Gin Ser Ala Lys Thr He Leu Arg Gly Thr 
340 345 350 

Glu Glu Lys Cys Gly Ser Pro Arg He Arg Thr Leu Ser Ser Ser Arg 
355 -. 360 365 

Pro Pro Leu Leu Leu Arg Leu Ser Glu Asn Ser Gly Asp Glu Asn Met 
370 375 380 

Ser Asp Val Thr Phe Asp Ser Leu Pro Ser Ser Pro Ser Ser Ala Thr 
385 3 *0 395 400 

Pro His Ser Gin Lys Leu Asp His Leu His Trp Pro Val Phe Gly Asp 
405 410 415 

Leu Asp Asn Met Gly Pro Arg Asp His Asp Arg Met Asp Asn His Arg 
420 425 4 3o 
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Asp Ser Glu Asn Ser Gly Asp Ser Gly Tyr Pro Ser Glu Lys Arg Ser 
435 440 445 

Asp Leu Asp Asp Pro Glu Pro Arg Glu His Gly His Ser Asn Gly Asn 
450 455 460 



Arg Arg His Glu Ser Asp Glu Asp Ser Leu Gly Ser Ser Gly Arg Val 

480 



465 . 47 0 475 



Cys Val Glu Thr Trp Asn Leu Leu Asn Pro Ser Arg Leu His Leu Pro 
485 490 495 

Arg Pro Ser Ala Val Ala Leu Glu Val Gin Arg Leu Asn Ala Leu Asp 
500 505 sio 

Leu Gly Arg Lys lie Gly Lys Ser Val Leu Gly Lys Val His Leu Ala 
515 520 525 

Met Val Arg Tyr His Glu Gly Gly Arg Phe Cys Glu Lys Asp Glu Glu 
530 535 540 

Trp Asp Arg Glu Ser Ala He Phe His Leu Glu His Ala Ala Asp Leu 
545 550 555 560 

Gly Glu Leu Glu Ala He, Val Gly Leu Gly Leu Met Tyr Ser Gin Leu 
565 570 575 

Pro His His He Leu Ala Asp Val Ser Leu Lys Glu Thr Glu Glu Asn 
580 585 59Q 

Lys Thr Lys Gly Phe Asp Tyr Leu Leu Lys Ala Ala Glu Ala Gly Asp 
595 * 600 605 

Arg His Ser Met He Leu Val Ala Arg Ala Phe Asp Thr Gly Leu. Asn 
610 615 620 



Ser Pro Asp Arg Cys Gin Asp Trp Ser Glu Ala Leu His Trp Tyr 
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625 

Asn Thr Ala Leu 

Gly lie Gin Asp 
660 

Met Leu Leu Thr 
675 

Gly Asp Leu Tyr 
690 

Gly Arg Leu Ala 
705 

Gin Met Glu Glu 



630 

Glu Thr Thr Asp 
645 

Glu Pro Gin Tyr 

Gly Gly Phe Gly 
680 

Thr Gin Ala Ala 
695 

Asn Gin Tyr Tyr 
710 



635 

Cys Asp Glu Gly 
650 

Ala Leu Leu Ala 
665 

Leu Asp Lys Asn 

Glu Ala Ala Met 
700 

Glu Lys Ala Glu 
715 



640 

Gly Glu Tyr Asp 
655 

Arg Glu Ala Glu 
670 

Pro Gin Arg Ser 
685 

Glu Ala Met Lys 

Glu Ala Trp Ala 
720 



INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3465 base pairs 

(B) TYPE: nucleic ^cid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyos telium discoideum 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ATGTTTAATA TAAAAAAGAG AAAAGAGAGT ATAACAGGTA TACCACCAAT AAATGTTAAT 
5 AGTCCACAAT CAGTTCCATT GAGTGGAACA TTGCAATCAC CATTGATTAC ACCAAATTCA 
CCAAATTTTG TTTCACGTCA ATGTCCATTC AAAAAGTTTG GATGTAGTAG TTTTTTAGTT 
TCAAAGGCAG AGTTTGATAA TCACTTAAAG GATGACGCAC AATTTCATTT ACAATTGGCA 

1G 

GTGGAGAAAT TTGATCATCA ATTTGATTTA CACACACAAT TGATGGCACA TTTTACTGAG 
CAAATGGAGG ATCAATTAGA GAAAACAATG AAGGTCGTAC GTAATCATAC AGATAGTTTA 
15 GGCGGTAATG TTCAAACCAA ATTGGATGAA GGCATTGAAA AATGT ATGG C TTTTGCTAAA 
AAGGTTGAAC AACAACAACA ACAATTGGCC AAAAGATTAA TC ACT CAACA AATTCAAGAG 
AAGAAATCAA CCTCTTCACC TTTAGTTAAA GGTGGTATTA GTGGTGGTGG TGGTAGTGGT 
GGCGATGATT CTTTTGATGG CGCAAATATA TCATCAATGT CAACTAGTAA ACAAGAATTA 
CAACAAGAAT TACAATCATT ATCAATTAAA ATGAAAAAAG AATTGACAGA ATTATCCGAT 
GAACTATCAC AAAAATTAGA ACGTTCAACA GGTAATATAG ATATTAAAAT AAAGAGAATC 
GAAGGTGAAG TTAATGAAAA GATTGATAAA CGTCAATTGG TCTCTACGAT CGATGATTCA 
ATTGGAAAGA AAACAGATTC CATCGGTTAT ACATTGGAGA GTTCAATCAT TAAAAAGGTT 
GAAGAGAAAG AGAAAAAGAA ATCCGAACAA AATCAACTTC TCTTTGATTC AAAGATTGAA 
TCCTTAAAAG ATAAGATTAA AATCATTGAA ACTCAACAAT TGGATACTTC ATCAGAGGTT 
AGAAAATTGA AATTAGAAAG TACAAGTAGT GGAAATTTAA TGGCAGGTCT TAATGGTACC 
TCTGGTAGAC CTTCATCATC TTCTCACTTT ATTCCATCCT CTGTTTCTGC CGCTGCTAAC 
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AATATCAACA AGAATGAAAT CATGGAAGAG GTTAAAAAGG TAGAAGAGAA ACTTCAAAAG 114 0 

AAAATTCGTG AAGAGATTGA TAATACAAAA GCTGAACTCT CAAAGGTTGA ACGTTCCGTT 12 00 

5 AAAGATAATC GTAGTGAAAT TGAAGGTTTG GAAAAAGATT GTAAGAATCA ATTCGATAAA 12 6 0 

CAAGACAATA AGATCAAACA AGTTGAGGAT GATTTGAAAA AG AG TG ATTC ATTACTTTTG 132 0 

TTAATGCAAA ATAACCTCAA GAAATATAAT GAATTTGTTG ATAGAGAACG TG AT CGTGAA 13 80 

10 

AGTGAACGTT TGAAACTTCA AG ATTC TAT C AAACGTTTAG AACAAAATCA AAAGAAAATC 144 0 

GAAGCTGAAA TTCAAGAAGG TAATGAACAA GTTGAACGTG TTTTACGTGA GGAAGCTTCA 1500 

15 ATCTCACCAA TTAGTTCAGT TCCAAAATCA CCAATCACAA CCAAACGTTC ATCGATTATT 156 0 

TTAAATT C AC C AC C AATG AC TTCACAACAA TCATCACCAA AGATTCAAGA TCTTCTCTCA 162 0 

AGTAGTGGTA GTAGTAGTGT TAG TGGT AT A AATATTTCCT CTGAAACCGG TGAAATGGGT 168 0 

20 

ATTCTTTGGG AATTTGATCC AATCATTAAC AAATGGATTA GATTATCAAT GAAGCTAAAG 174 0 

GTAGAAAGAA AACCATTTGC AGAGGGTGCT C T T AG AG AGG CTTATCATAC CGTTTCATTG 18 0 0 

25 GGTGTTGGAA CCGATGAAAA TTATCCAJTA GGTACAACCA CCAAATTATT CCCACCAATT 18 6 0 

GAAATGATTT CACCAATTTC AAAGAATAAT GAGGCAATGA CTCAATTGAA GAATGGTACA 192 0 

AAATTTGTTT TGAAACTCTA CAAAAAGGAA GCTGAACAAC AAGCTAGCAG AGAATTATAC 198 0 

30 

TTTGAAGATG TTAAAATGCA AATGGTCTGT AG AG ATTGG G GTAATAAATT CAATCAAAAG 2 04 0 

AAACCACCAA AGAAAATTGA ATTCCTTATG TCTTGGGTTG TAGAGTTAAT CG AT AG AT C T 210 0 

35 CCTTCTTCCA ATGGTCAACC AATACTTTGT TCCATTGAAC CATTATTGGT TGGTGAATTC 216 0 

AAAAAGAATA ATTCAAATTA TGGTGCAGTT TTAACCAATC GTTCAACTCC ACAAGCATTC 2220 



73 

TCTCATTTCA CCTATGAACT CTCAAATAAA CAAATGATCG TTGTCGATAT TCAAGGTGTT 
GATGATCTTT ACACTGATCC TCAAATTCAT ACACCCGATG GTAAAGGATT TGGTCTTGGT 
AATCTTGGTA AAGCAGGTAT CAATAAATTC ATCACCACTC ACAAATGTAA TGCTGTTTGT 
GCTCTTTTAG ATTTAGATGT TAAATTGGGT GGTGTACTAT CTGGAAATAA TAAGAAACAA 
CTTCAACAAG GTACTATGGT TATGCCAGAT ATTCTCCCAG AACTTATGCC ATCTGATAAC 
AC C ATT AAAG TGGGTGCAAA ACAACTTCCA AAAGCTGAAT TCTCAAAGAA AGATCTCAAA 
TGTGTTAGCA CCATTCAAAG TTTCCGTGAA CGTGTTAACT CGATCGCATT CTTTGATAAT 
CAAAAGTTAT TATGCGCTGG TTATGGTGAT GGTACCTATA GAGTTTTCGA TGTCAATGAC 
AATTGGAAAT GTTTATACAC TGTCAATGGT CATAGAAAAT CAATTGAAAG TATCGCTTGT 
AATAGTAATT ACATTTTCAC TTCATCACCT GATAACACCA TCAAAGTTCA TATCATTCGT 
AGTGGTAACA CCAAATGTAT AGAGACATTG GTTGGTCACA CTGGTGAAGT TAATTGTGTC 
GTGGCCAATG AAAAAT AT CT TTTCAGTTGT AGTTATGATA AAACTATCAA GGTTTGGGAT 
TTGTCAACCT TTAAAGAAAT TAAATCAXTT GAGGGTGTTC ATACAAAGTA CATTAAAACA 
TTGGCTTTGA GTGGACGTTA TCTTTTTAGT GGTGGTAACG ATCAAATCAT TTACGTTTGG 
GATACTGAAA CACTTAGTAT GCTTTTCAAT ATGCAAGGTC ATGAAGATTG GGTACTCTCT 
CTTCATTGTA CCGCTAGTTA TCTTTTCTCA ACCTCAAAAG ATAATGTCAT CAAGATTTGG 
GATCTCTCAA ATTTCAGTTG TATCGATACT CTAAAAGGTC ATTGGAATTC TGTCTCAAGT 
TGTGTCGTAA AAGATCGTTA TCTATACAGT GGTTCTGAAG ATAATTCAAT CAAAGTTTGG 
GATCTCGATA CACTTGAATG TGTTTACACC ATTCCAAAAT CTCATTCTTT GGGTGTAAAA 




TGTTTAATGG TTTTCAATAA TCAAATCATT TCTGCTGCTT TCGATGGTTC AATTAAAGTT 
TGGGAATGGC AATCGAAATA ATCTTTGTAA ATTTTTGTTA AAAAA 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Die tyostel ium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Phe Asn lie Lys Lys Arg Lys Glu Ser lie Thr Gly lie Pro Pro 
1 5 10 15 

lie Asn Val Asn Ser Pro Gin Ser Val Pro Leu Ser Gly Thr Leu Gin 
20 25 30 

Ser Pro Leu lie Thr Pro Asn Ser Pro Asn Phe Val Ser Arg Gin Cys 
35 40 45 

Pro Phe Lys Lys Phe Gly Cys Ser Ser Phe Leu Val Ser Lys Ala Glu 
50 55 60 



Phe Asp Asn His Leu Lys Asp Asp Ala Gin Phe His Leu Gin Leu Ala 
65 70 75 80 
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Val Glu Lys Phe Asp His Gin Phe Asp Leu His Thr Gin Leu Met Ala 
85 go 95 

His Phe Thr Glu Gin Met Glu Asp Gin Leu Glu Lys Thr Met Lys Val 
100 105 110 

Val Arg Asn His Thr Asp Ser Leu Gly Gly Asn Val Gin Thr Lys Leu 
11S . 120 125 

Asp Glu Gly lie Glu Lys Cys Met Ala Phe Ala Lys Lys Val Glu Gin 
130 135 X40 

Gin Gin Gin Gin Leu Ala Lys Arg Leu He Thr Gin Gin He Gin Glu 
145 150 iss 160 

Lys Lys Ser Thr Ser Ser Pro Leu Val Lys Gly Gly He Ser Gly Gly 
16 5 170 175 

Gly Gly Ser Gly Gly Asp Asp Ser Phe Asp Gly Ala Asn He Ser Ser 
180 185 i 9 o 

Met Ser Thr Ser Lys Gin Glu Leu Gin Gin Glu Leu Gin Ser Leu Ser 
195 200 205 

He Lys Met Lys Lys Glu..Leu Thr Glu Leu Ser Asp Glu Leu Ser Gin 
210 215 220 

Lys Leu Glu Arg Ser Thr Gly Asn He Asp He Lys He Lys Arg He 
225 230 235 240 

Glu Gly Glu Val Asn Glu Lys He Asp Lys Arg Gin Leu Val Ser Thr 
245 250 255 

He Asp Asp Ser He Gly Lys Lys Thr Asp Ser He Gly Tyr Thr Leu 
260 265 270 

Glu Ser Ser He He Lys Lys Val Glu Glu Lys Glu Lys Lys Lys Ser 
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275 280 285 

Glu Gin Asn Gin Leu Leu Phe Asp Ser Lys lie Glu Ser Leu Lys Asp 
290 295 300 

Lys lie Lys lie lie Glu Thr Gin Gin Leu Asp Thr Ser Ser Glu Val 
305 310 315 320 

Arg Lys Leu Lys Leu Glu Ser Thr Ser Ser Gly Asn Leu Met Ala Gly 
325 330 335 

Leu Asn Gly Thr Ser Gly Arg Pro Ser Ser Ser Ser His Phe lie Pro 
340 345 350 

Ser Ser Val Ser Ala Ala Ala Asn Asn lie Asn Lys Asn Glu lie Met 
355 360 365 

Glu Glu Val Lys Lys Val Glu Glu Lys Leu Gin Lys Lys lie Arg Glu 
370 375 380 

Glu lie Asp Asn Thr Lys Ala Glu Leu Ser Lys Val Glu Arg Ser Val 
385 390 395 400 

Lys Asp Asn Arg Ser Glu lie Glu Gly Leu Glu Lys Asp Cys Lys Asn 
405 410 415 

Gin Phe Asp Lys Gin Asp Asn Lys lie Lys Gin Val Glu Asp Asp Leu 
420 425 430 

Lys Lys Ser Asp Ser Leu Leu Leu Leu Met Gin Asn Asn Leu Lys Lys 
435 440 445 

Tyr Asn Glu Phe Val Asp Arg Glu Arg Asp Arg Glu Ser Glu Arg Leu 
450 455 460 

Lys Leu Gin Asp Ser lie Lys Arg Leu Glu Gin Asn Gin Lys Lys lie 
465 470 475 480 
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Glu Ala Glu lie Gin Glu Gly Asn Glu Gin Val Glu Arg Val Leu Arg 
4Q 5 490 495 

Glu Glu Ala Ser He Ser Pro He Ser Ser Val Pro Lys Ser Pro He 



500 505 



510 



Thr Thr Lys Arg Ser Ser He He Leu Asn Ser Pro Pro Met Thr Ser 
515 520 525 

Gin Gin Ser Ser Pro Lys He Gin Asp Leu Leu Ser Ser Ser Gly Ser 
530 535 540 

Ser Ser Val Ser Gly He Asn He Ser Ser Glu Thr Gly Glu Met Gly 
545 5 50 555 560 

He Leu Trp Glu Phe Asp Pro He He Asn Lys Trp He Arg Leu Ser 
56 5 570 575 

Met Lys Leu Lys Val Glu Arg Lys Pro Phe Ala Glu Gly Ala Leu Arg 
580 585 590 

Glu Ala Tyr His Thr Val Ser Leu Gly Val Gly Thr Asp Glu Asn Tyr 
595 600 605 

Pro Leu Gly Thr Thr Thr-.Lys Leu Phe Pro Pro He Glu Met He Ser 
610 615 . 620 

Pro He Ser Lys Asn Asn Glu Ala Met Thr Gin Leu Lys Asn Gly Thr 
625 630 635 640 

Lys Phe Val Leu Lys Leu Tyr Lys Lys Glu Ala Glu Gin Gin Ala Ser 
645 650 655 

Arg Glu Leu Tyr Phe Glu Asp Val Lys Met Gin Met Val Cys Arg Asp 
660 665 670 



Trp Gly Asn Lys Phe Asn Gin Lys Lys Pro Pro Lys Lys He Glu Phe 
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675 680 685 

Leu Met Ser Trp Val Val Glu Leu lie Asp Arg Ser Pro Ser Ser Asn 
690 695 700 



Gly Gin Pro lie 
705 

Lys Lys Asn Asn 

Pro Gin Ala Phe 
740 

lie Val Val Asp 
755 

lie His Thr Pro 
770 

Ala Gly lie Asn 
785 



Leu Cys Ser lie 
710 

Ser Asn Tyr Gly 
725 

Ser His Phe Thr 

lie Gin Gly Val 
760 

Asp Gly Lys Gly 
775 

Lys Phe lie Thr 
790 



Glu Pro Leu Leu 
715 

Ala Val Leu Thr 
730 

Tyr Glu Leu Ser 
745 

Asp Asp Leu Tyr 

Phe Gly Leu Gly 
780 

Thr His Lys Cys 
795 



Val Gly Glu Phe 
720 

Asn Arg Ser Thr 
735 

Asn Lys Gin Met 
750 

Thr Asp Pro Gin 
765 

Asn Leu Gly Lys 

Asn Ala Val Cys 
800 



Ala Leu Leu Asp Leu Asp Val Lys Leu Gly Gly Val Leu Ser Gly Asn 
805 810 815 

Asn Lys Lys Gin Leu Gin Gin Gly Thr Met Val Met Pro Asp lie Leu 
820 825 830 



Pro Glu Leu Met Pro Ser Asp Asn Thr lie Lys Val Gly Ala Lys Gin 
835 840 845 

Leu Pro Lys Ala Glu Phe Ser Lys Lys Asp Leu Lys Cys Val Ser Thr 
850 855 860 

lie Gin Ser Phe Arg Glu Arg Val Asn Ser lie Ala Phe Phe Asp Asn 
865 870 875 880 
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Gin Lys Leu Leu Cys Ala Gly Tyr Gly Asp Gly Thr Tyr Arg Val Phe 
885 890 895 

Asp Val Asn Asp Asn Trp Lys Cys Leu Tyr Thr Val Asn Gly His Arg 



900 905 



910 



Lys Ser He Glu Ser He Ala Cys Asn Ser Asn Tyr He Phe Thr Ser 



91 5 920 



925 



Ser Pro Asp Asn Thr He Lys Val His lie He Arg Ser Gly Asn Thr 
930 935 940 

Lys Cys He Glu Thr Leu Val Gly His Thr Gly Glu Val Asn Cys Val 



945 



950 



955 



960 



Val Ala Asn Glu Lys Tyr Leu Phe Ser Cys Ser Tyr Asp Lys Thr lie 



965 970 975 



Lys Val Trp Asp Leu Ser Thr Phe Lys Glu He Lys Ser Phe Glu Gly 
980 985 990 



Val His Thr Lys Tyr He Lys Thr Leu Ala Leu Ser Gly Arg Tyr 
995 1000 100 5 



Phe Ser Gly Gly Asn Asp-.Gln He He Tyr Val ^Trp Asp Thr Glu Thr 
1010 1015 1020 

Leu Ser Met Leu Phe Asn Met Gin Gly His Glu Asp Trp Val Leu Ser 

- X040 

Leu His Cys Thr Ala Ser Tyr Leu Phe Ser Thr Ser Lys Asp Asn Val 
1045 1050 1055 

He Lys He Trp Asp Leu Ser Asn Phe Ser Cys He Asp Thr Leu Lys 
1060 1065 1070 



Gly His Trp Asn Ser Val Ser Ser Cys Val Val Lys Asp Arg Tyr 



Leu 
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107 5 1080 1085 

Tyr Ser Gly Ser Glu Asp Asn Ser He Lys Val Trp Asp Leu Asp Thr 

1095 iioo 

Leu Glu Cys Val Tyr Thr He Pro Lys Ser His Ser Leu Gly Val Lys 
1105 mo H15 1120 

Cys Leu Met Val Phe Asn Asn Gin He He Ser Ala Ala Phe Asp Gly 
1125 H30 1135 

Ser He Lys Val Trp Glu Trp Gin, Ser Lys 
1140 H45 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2237 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyos tel ium discoideum * 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
ATAAGAAGAT AGAAGATGAT ATTTAAAGTT TGGTTTTCAT ATGAAGATGA GGAAGTGGAA 
CTATCAGAAT TAACAAATGA TACAACAGTG TCAGCAATTA GAAAGATCTT ACATGAAGGT 1 
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AAAATATTTA GATTTCCATA TGGTACATCT CAAACAGACT TGCAAATTGG AAAGATGTTA 
CCATCTGGTA GTGGTGGAGG TGCAACTGCA GACAGCAAAT TTGAGAAGTT TAAAGCACGT 
5 AATACATTAG CAGATATTCA ATATAAAGTT GGTGATACAT TATATGTTAG AGTTAAAAAA 
AGTAAACCAA CAAATGATTC ATTATTACCA ACATTAAATA TAGCATTTTT AGATGGATCA 
GAACGTGCAA TTAAATGGGA ATATGACCCA TATACTACAA CTGCTCAATG GACCTGTACA 



180 



240 



300 



360 



420 



10 

GCAACATTAG TCAAAGTTGA ACCAGTACCA TTTGCTGAAG GTGCATTTAG GAAAGCTTAT 480 

CATACATTGG ATTTAAGTAA ATCTGGTGCA AGTGGAAGAT ATGTATCAAA GATTGGTAAA 54 0 

15 AAACCAACAC C AAGAC CATC ATATTTTGAA GATGTAAAGA TGCAAATGAT AGCAAAGAAA 600 

TGGGCAGATA AATATAATTC ATTTAAACCT CCAAAAAAGA TTGAATTTTT ACAATCATGC 660 

GTTTTAGAGT TTGTAGATAG AACATCATCA GATTTAATTT GTGGAGCAGA ACCATATGTA 72 0 

20 

GAAGGACAAT ATAGAAAGTA TAATAATAAT AGTGGATTCG TTAGTAATGA TGAAAGAAAT 78 0 

ACACCACAAT CATTCTCTCA TTTCACATAT GAACATTCAA ATCATCAATT ATTGATTATA 84 0 

25 GATATTCAAG GTGTTGGTGA TC AC TATAC A GACCCACAAA TTCATACCTA TGATGGTGTT 90 0 

GGTTTTGGTA TTGGTAATTT GGGTCAAAAA GGTTTTGAAA AGTTTTTAGA TACTCATAAA 96 0 

TGTAATGCAA TTTGCCAATA TTTAAATTTA CAATCAATTA ATCCAAAATC TGAAAAAAGT 102 0 

30 

GATTGTGGTA CTGTACCAAG ACCAGATTTA ATTTTCCCTG ATACATCTGA AAGAGATAAT 108 0 

AATAATAATA ATAATAATAA TAATAATAAT AATAATAATA ATAATAATAA TAATAGTAAT 114 0 

35 AATAATAATA ATAACAATAG TAGTATTTCA AAATCATTAG TTGAAATTTC AAGTGGTAGT 1200 

AAAGAAAGAA ATGATAGAGA TTCGCCAAGT AGACAATTAT TTGTTTCAAA TGATGGTAAT 1260 
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(A) LENGTH: 732 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostelium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID 



NO: 8: 



Met He Phe Ly S Val Trp Phe Se r Tyr Glu Asp Glu Glu 



Val Glu Leu 
10 15 



20 

His Glu Gly Ly S He Phe Arg Phe Pro Tyr Qly Thr ^ ^ ^ 

65 70 

/b 80 

100 
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Asp Gly Ser Glu Arg Ala lie Lys Trp Glu Tyr Asp Pro Tyr Thr Thr 
115 120 125 

Thr Ala Gin Trp Thr Cys Thr Ala Thr Leu Val Lys Val Glu Pro Val 
130 135 140 

Pro Phe Ala Glu Gly Ala Phe Arg Lys Ala Tyr His Thr Leu Asp Leu 
145 - 150 155 160 

Ser Lys Ser Gly Ala Ser Gly Arg Tyr Val Ser Lys lie Gly Lys Lys 
• 165 170 175 

Pro Thr Pro Arg Pro Ser Tyr Phe Glu Asp Val Lys Met Gin Met lie 
180 185 190 

Ala Lys Lys Trp Ala Asp Lys Tyr Asn Ser Phe Lys Pro Pro Lys Lys 
195 200 205 

lie Glu Phe Leu Gin Ser Cys Val Leu Glu Phe Val Asp Arg Thr Ser 
210 215 220 

Ser Asp Leu lie Cys Gly Ala Glu Pro Tyr Val Glu Gly Gin Tyr Arg 
225 230 235 240 

Lys Tyr Asn Asn Asn Ser-Gly Phe Val Ser Asn Asp Glu Arg Asn Thr 
245 250 255 

Pro Gin Ser Phe Ser His Phe Thr Tyr Glu His Ser Asn His Gin Leu 
260 265 270 

Leu lie lie Asp lie Gin Gly Val Gly Asp His Tyr Thr Asp Pro Gin 
275 280 285 

lie His Thr Tyr Asp Gly Val Gly Phe Gly lie Gly Asn Leu Gly Gin 
290 295 300 



Lys Gly Phe Glu Lys Phe Leu Asp Thr His Lys Cys Asn Ala lie Cys 



85 

305 310 315 

J15 320 



Gin Tyr Leu Asn Leu Gin Ser He Asn Pro Lys Ser Glu 



325 



330 



Lys Ser Asp 
3 35 



Cys oiy Thr val Pro Arg Pro Asp Leu Ile phe prQ Asp ^ ^ 



340 



345 



350 



Arg Asp Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn 



355 



360 



365 



Asn Asn Asn Asn Asn Ser Asn Asn Asn Asn Asn Asn Asn Ser Ser He 



370 



375 



380 

Ser Lys Ser Leu Val Glu He Ser 



385 



Ser Gly Ser Lys Glu Arg Asn Asp 
395 400 

Arg Asp Ser Pro Ser Arg Gin Leu Phe Val Ser Asn Asp Gly Asn Thr 



405 



410 



415 



Leu Asn Thr Asn Lys Glu Arg Ser Lys Ser Lys Ser lie Asp Leu Glu 
420 425 430 

Lys Pro Glu lie Leu He Asn Asn Lys Lys Lys Glu Ser lie Asn Leu 
Glu Thr lie Lys Leu He Glu Thr lie Lys Gly Tyr His Val Thr Ser 
His Leu Cys lie Cys Asp Asn Leu Leu Phe Thr Gly Cys Ser Asp Asn 



465 



470 



475 



480 



Ser He Arg Val Tyr Asp Tyr Lys Ser Gin Asn Met Glu Cys Val Gin 



485 



490 



495 



Thr Leu Lys Gly His Glu Gly Pro Val Glu Ser lie Cys Tyr Asn Asp 



500 



505 



510 



86 

Gin Tyr Leu Phe Ser Gly Ser Ser Asp His Ser lie Lys Val Trp Asp 
515 520 525 

Leu Lys Lys Leu Arg Cys lie Phe Thr Leu Glu Gly His Asp Lys Pro 
530 535 540 

Val His Thr Val Leu Leu Asn Asp Lys Tyr Leu Phe Ser Gly Ser Ser 
545 - 550 555 560 

Asp Lys Thr lie Lys Val Trp Asp Leu Lys Thr Leu Glu Cys Lys Tyr 
565 570 575 

Thr Leu Glu Ser His Ala Arg Ala Val Lys Thr Leu Cys lie Ser Gly 
580 585 590 

Gin Tyr Leu Phe Ser Gly Ser Asn Asp Lys Thr lie Lys Val Trp Asp 
595 600 605 

Leu Lys Thr Phe Arg Cys Asn Tyr Thr Leu Lys Gly His Thr Lys Trp 
610 615 620 

Val Thr Thr lie Cys lie Leu Gly Thr Asn Leu Tyr Ser Gly Ser Tyr 
625 630 635 640 

Asp Lys Thr lie Arg Val-Trp Asn Leu Lys Ser Leu Glu Cys Ser Ala 
645 650 655 

Thr Leu Arg Gly His Asp Arg Trp Val Glu His Met Val lie Cys Asp 
660 665 670 

Lys Leu Leu Phe Thr Ala Ser Asp Asp Asn Thr lie Lys lie Trp Asp 
675 680 685 

Leu Glu Thr Leu Arg Cys Asn Thr Thr Leu Glu Gly His Asn Ala Thr 
690 695 700 



Val Gin Cys Leu Ala Val Trp Glu Asp Lys Lys Cys Val lie Ser Cys 
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705 



710 



715 



720 



y. 3 



U1 
hi- 



10 



15 



20 



Ser His Asp Gin Ser lie Arg Val Trp Gly Trp Asn 
725 730 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2307 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. elegans 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



25 ATGACG AT CG ACACAACAAA TGAGAGCGAC AATAGTCCAA CTAACTCACC 



AGGATTGGAG 



60 



30 



GCCTCGGCTC GGACATTCTC GCTCAATGCG TCAAAAATGG TTCGGATAAC CGACGACTAC 
GCAGATGAAG TGTTCATTGA ACAGAATGAT GTCGTTATCG AGAAGCCTCG TATGGATCCT 
CTCCACGTTA GAAAACTTAT GGAGACATGG CGCAAGGCTG CTCGCCGAGC AAGAACAAAC 



12 0 



180 



240 



TATATAGATC CATGGGATGA GTTCAACATC CACGAGTATC CAGTACAACG AGCTAAACGA 
35 TATAGGTATT CTGCAATCAG AAAGCAATGG ACAGAGGATA TAGTCGATGT GAGACTTCAT 



300 



360 



CCGGACAGTT TTGCACGTGG AGCCATGCGA GAATGCTACC GACTCAAAAA GTGCTCCAAG 420 



CACGGAACAA GTCAAGATTG 

GATCGTAGAG TTCTTTTCGA 

GAATATAATC GGTATAATCC 

ATGATTGATG TAAAAGGTTC 

ATAAAATACA ATTCAAACTC 

TTTTCTCACT TCACCTTCGA 

GTTGGTG AT C TTTACACAGA 

GGAAACCTCG GAACTCGTGG 

TGTGAGACAA TGGATCTATC 

GTTGCGATGG AAGTAGCTGC 

GAAGCAAGAA GAAATCGAAT 

GATCAATTGA GAAAAAGGAA 

CACAACGAAG ACTGTGTATG 

TGCTCCGAAG ATGAAGAGGA 

AGTCAGAAAA GTCGACGTAG 

GCATCTCGTC CTAGAAAATG 

AGCTTCAGAA GTTCTGTTGG 

GAAAAGGATG AATTCTGGAA 

CTTCAACTTC AACAAATGGC 
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GAGCAGCAAC TATGTCGCAA 
TGATGTCAGA CTTCAGATGG 
ACCGAAGAAA ATTGATATTG 
TCCACTCTAT CATTTGGAGC 
AGGATTTGTA TCAAATGCAG 
ACGTTCTGGT CATCAAATGA 
TCCTCAGATT CATACAGTTG 
AATGGCTCTT TTCTTCCATT 
AAATTTCGAA CTTTCGCCAC 
AAAGCAGAAA AAGTCATGCA 
TTCAAGTGAA TGTGTACATG 
GACGTTGAAT CAATCGTCAA 
TCCTGAGTGT ATTCCAGTTG 
CGAAGAAGAA GACTATCCAA 
TAGAATGAGC ATTTCAACGA 
CGGATTTGTA GATTTAAACT 
GACATATTCT ATGAATAGTT 
GGTTCTTCGA AAACAATCAG 
TGCTAACCTG GAAAATGATG 



AAAGATACAT TTGTCAAGTC 

ATGCCAAATT ATGGGCTGAA 

TTCAAATGTG TGTCATTGAG 

ATTTCATCGA GGGAAAATAT 

CTCGTCTTAC ACCACAAGCA 

TGGTTGTCGA TATTCAAGGA 

TGGGAACTGA TTATGGAGAT 

CACACAGATG TAACGATATT 

CTGAAATCGA GGCTACCGAA 

TAGTTCCTCC AACTGTGTTC 

TCGAGCATGG TATTTCGATG 

CCGATTTGTC AG C AAAG AG T 

TTGAGCAACT CTGTGAGCCT 

GAAGTGAAAA AAGTGGAAAT 

GATCTTCTGG CGATGAATCA 

CACTTCGTCA GAGACATGAT 

CTAGACAAAC CAGAGACACT 

TTCCAGCAAA CATTCTATCA 

AAGACGTACC AC AAGTC AC C 



— 89 ^ 

GGGCATCAGT TCTCTGTCCT CGGTCAGATT CATATTGATC TCTCACGATA TCATGAGCTC 
GGGCGGTTCG TAGAAGTTGA TTCAGAACAT AAGGAAATGC TTGAGGGAAG TGAAAATGAC 
GCTCGTGTAC CAATCAAATA CGACAAGCAG TCTGCAATTT TCCATTTGGA TATCGCTCGG 
AAGTGTGGAA TCCTTGAGGC TGTGCTAACA TCGGCTCATA TTGTTCTCGG ATTACCACAT 
GAATTGTTGA AAGAAGTCAC CGTTGATGAT CTGTTTCCTA ATGGGTTTGG AGAACAGGAA 
AATGGAATTC GAGCTGATAA AGGACAAAAA CCTTGTGACC TAGAAGAGTT CGGCTCCGAT 
CTGATGGAAA TTGCTGCAGA GATGGGTGAT AAGGGTGCAA TGCTGTACAT GGCACACGCT 
TATGAAACTG GTCAGCATCT CGGACCGAAT CGAAGAACGG ATTATAAGAA ATCGATTGAT 
TGGTATCAAC GCGTCGTTGG ATTCCAAGAA GAAGAAGAAC TTGACTCTGA TTGTGGAAAA 
AC G AC ATTCT CCTCATTTGC TCCACTGACT CGTCACGAGA TTCTAGCCAA AATGGCTGAA 
ATGTACAAAG AGGGAGGTTA TGGCCTGAAT CAAGACTTCG AACGAGCATA TGGTCTATTC 
AATGAAGCTG CTGAAGCAGC AATGGAAGCA ATGAATGGAA AGCTCGCAAA TAAATACTAT 
GAAAAAGCGG AAATGTGTGG AGAATGA. 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 768 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. elegans 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Thr lie Asp 
1 

Pro Gly Leu Glu 
20 

Met Val Arg lie 
35 

Asn Asp Val Val 
50 

Lys Leu Met Glu 
65 

Tyr lie Asp Pro 

Arg Ala Lys Arg 
100 

Asp lie Val Asp 
115 

Met Arg Glu Cys 
130 

Gin Asp Trp Ser 
145 



Thr Thr Asn Glu 
5 

Ala Ser Ala Arg 

Thr Asp Asp Tyr 
40 

lie Glu Lys Pro 
55 

Thr Trp Arg Lys 
70 

Trp Asp Glu Phe 
85 

Tyr Arg Tyr Ser 

Val Arg Leu His 
120 

Tyr Arg Leu Lys 
135 

Ser Asn Tyr Val 
150 



Ser Asp Asn Ser 
10 

Thr Phe Ser Leu 
25 

Ala Asp Glu Val 

Arg Met Asp Pro 
60 

Ala Ala Arg Arg 
75 

Asn lie His Glu 
90 

Ala lie Arg Lys 
105 

Pro Asp Ser Phe 



Lys Cys Ser Lys 
140 

Ala Lys Arg Tyr 
155 



Pro Thr Asn Ser 
15 

Asn Ala Ser Lys 
30 

Phe lie Glu Gin 
45 

Leu His Val Arg 

Ala Arg Thr Asn 
80 

Tyr Pro Val Gin 
95 

Gin Trp Thr Glu 
110 

Ala Arg Gly Ala 
125 

His Gly Thr Ser 

lie Cys Gin Val 
160 



Asp Arg Arg Val Leu Phe Asp Asp Val Arg Leu Gin Met Asp Ala Lys 
16 5 170 175 

Leu Trp Ala Glu Glu Tyr Asn Arg Tyr Asn Pro Pro Lys Lys He Asp 
180 185 i9o 

He Val Gin Met Cys Val He Glu Met He Asp Val Lys Gly Ser. Pro 
195 200 205 

Leu Tyr His Leu Glu His Phe He Glu Gly Lys Tyr He Lys Tyr Asn 
210 215 220 

Ser Asn Ser Gly Phe Val Ser Asn Ala Ala Arg Leu Thr Pro Gin Ala 
225 230 235 240 

Phe Ser His Phe Thr Phe Glu Arg Ser Gly His Gin Met Met Val Val 
245 250 255 

Asp He Gin Gly Val Gly Asp Leu Tyr Thr Asp Pro Gin He His Thr 
260 265 270 

Val Val Gly Thr Asp Tyr Gly Asp Gly Asn Leu Gly Thr Arg Gly Met 
275 280 285 

Ala Leu Phe Phe His Ser.jHis Arg Cys Asn Asp He Cys Glu Thr Met 
290 295 300 

Asp Leu Ser Asn Phe Glu Leu Ser Pro Pro Glu He Glu Ala Thr Glu 
305 310 315 320 

Val Ala Met Glu Val Ala Ala Lys Gin Lys Lys Ser Cys He Val Pro 
32 5 330 335 

Pro Thr Val Phe Glu Ala Arg Arg Asn Arg He Ser Ser Glu Cys Val 
340 345 350 



His Val Glu His Gly He Ser Met Asp Gin Leu Arg Lys 



Arg Lys Thr 



# 

355 

Leu Asn Gin 
370 

Cys Val Cys 
385 

Cys Ser Glu 

Lys Ser Gly 

Thr Arg Ser 
435 

Phe Val Asp 
450 

Ser Val Gly 
465 

Glu Lys Asp 
Asn lie Leu 



Asp Glu Asp 
515 

Gin lie His 
530 



Ser Ser Thr 

Pro Glu Cys 
390 

Asp Glu Glu 
405 

Asn Ser Gin 
420 

Ser Gly Asp 

Leu Asn Ser 

Thr Tyr Ser 
470 

Glu Phe Trp Lys 
485 

Ser Leu Gin Leu 
500 

Val Pro Gin Val 

lie Asp Leu Ser 
535 
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360 

Asp Leu Ser Ala 
375 

lie Pro Val Val 

Asp Glu Glu Glu 
410 

Lys Ser Arg Arg 
425 

Glu Ser Ala Ser 
440 

Leu Arg Gin Arg 
455 

Met Asn Ser Ser 



Val Leu Arg 
490 

Gin Gin Met 
505 

Thr Gly His 
520 

Arg Tyr His 




365 

Lys Ser His 
380 

Glu Gin Leu 
395 

Asp Tyr Pro 

Ser Arg Met 

Arg Pro Arg 
445 

His Asp Ser 
460 

Arg Gin Thr 
475 

Lys Gin Ser 
Ala Ala Asn 



Gin Phe Ser 
525 

Glu Leu Gly 
540 



Asn Glu Asp 

Cys Glu Pro 
400 

Arg Ser Glu 
415 

Ser lie Ser 
430 

Lys Cys Gly 

Phe Arg Ser 

Arg Asp Thr 
480 

Val Pro Ala 
495 

Leu Glu Asn 
510 

Val Leu Gly 
Arg Phe Val 



Glu Val Asp Ser Glu His Lys Glu Met Leu Glu Gly Ser Glu Asn Asp 
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Ala Arg Val Pro 

Asp lie Ala Arg 
580 

His lie Val Leu 

Asp Asp Leu Phe 
610 

Ala Asp Lys Gly 
625 

Leu Met Glu lie 

Met Ala His Ala 
660 



He Lys Tyr Asp 
565 

Lys Cys Gly He 

Gly Leu Pro His 
600 

Pro Asn Gly Phe 
615 

Gin Lys Pro Cys 
630 

Ala Ala Glu Met 
645 

Tyr Glu Thr Gly 



Lys Gin Ser Ala 
570 

Leu Glu Ala Val 
585 

Glu Leu Leu Lys 

Gly Glu Gin Glu 
620 

Asp Leu Glu Glu 
635 

Gly Asp Lys Gly 
650 

Gin His Leu Gly 
665 



He Phe His Leu 
575 

Leu Thr Ser Ala 
590 

Glu Val Thr Val 
605 

Asn Gly lie Arg 

Phe Gly Ser Asp 
640 

Ala Met Leu Tyr 
655 

Pro Asn Arg Arg 
670 



Thr Asp Tyr Lys 
675 

Gin Glu Glu Glu 
690 

Ser Phe Ala Pro 
705 

Met Tyr Lys Glu 

Tyr Gly Leu Phe 
740 

Gly Lys Leu Ala 



Lys Ser He Asp 
680 

Glu Leu -Asp Ser 
695 

Leu Thr Arg His 
710 

Gly Gly Tyr Gly 
725 

Asn Glu Ala Ala 
Asn Lys Tyr Tyr 



Trp Tyr Gin Arg 

Asp Cys Gly Lys 
700 

Glu He Leu Ala 
715 

Leu Asn Gin Asp 
730 

Glu Ala Ala Met 
745 

Glu Lys Ala Glu 



Val Val Gly Phe 
685 

Thr Thr Phe Ser 

Lys Met Ala Glu 
720 

Phe Glu Arg Ala 
735 

Glu Ala Met Asn 
750 

Met Cys Gly Glu 
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7 55 760 765 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. elegans 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

) 

ATGACGATCG ACACAACAAA TGAGAGCGAC AATAGTCCAA CTAACTCACC AGGATTGGAG 
GCCTCGGCTC GGACATTCTC GCTCAATGCG TCAAAAATGG TTCGGATAAC CGACGACTAC 
GCAGATGAAG TGTTCATTGA ACAGAATGAT GTCGTTATCG AGAAGCCTCG TATGGATCCT 
CTCCACGTTA GAAAACTTAT GGAGACATGG CGCAAGGCTG CTCGCCGAGC AAGAACAAAC 
TATATAGATC CATGGGATGA GTTCAACATC CACGAGTATC CAGTACAACG AGCTAAACGA 
TATAGGTATT CTGCAATCAG AAAGCAATGG ACAGAGGATA TAGTCGATGT GAGACTTCAT 
CCGGACAGTT TTGCACGTGG AGCCATGCGA GAATGCTACC GACTCAAAAA GTGCTCCAAG 
CACGGAACAA GTCAAGATTG GAGCAGCAAC TATGTCGCAA AAAGATACAT TTGTCAAGTC 



95 

GATCGTAGAG TTCTTTTCGA TGATGTCAGA CTTCAGATGG ATGCCAAATT ATGGGCTGAA 



10 

GTTGGTGATC TTTACACAGA TCCTCAGATT CATACAGTTG TGGGAACTGA TTATGGAGAT 



540 



600 



GAATATAATC GGTATAATCC ACCGAAGAAA ATTGATATTG TTCAAATGTG TGTCATTGAG 
5 ATGATTGATG TAAAAGGTTC TCCACTCTAT CATTTGGAGC ATTTCATCGA GGGAAAATAT 660 
ATAAAATACA ATTCAAACTC AGGATTTGTA TCAAATGCAG CTCGTCTTAC ACCACAAGCA 720 
TTTTCTCACT TCACCTTCGA ACGTTCTGGT CATCAAATGA TGGTTGTCGA TATTCAAGGA 



780 



840 



GGAAACCTCG GAACTCGTGG AATGGCTCTT TTCTTCCATT CACACAGATG TAACGATATT 900 
15 TGTGAGACAA TGGATCTATC AAATTTCGAA CTTTCGCCAC CTGAAATCGA GGCTACCGAA 960 
GTTGCGATGG AAGTAGCTGC AAAGCAGAAA AAGTCATGCA TAGTTCCTCC AACTGTGTTC 1020 
GAAGCAAGAA GAAATCGAAT TTCAAGTGAA TGTGTACATG TCGAGCATGG TATTTCGATG 

20 

GATCAATTGA GAAAAAGGAA GACGTTGAAT CAATCGTCAA CCGATTTGTC AGCAAAGAGT 



1080 



1140 



1200 



CACAACGAAG ACTGTGTATG TCCTGAGTGT ATTCCAGTTG TTGAGCAACT CTGTGAGCCT 
25 TGCTCCGAAG ATGAAGAGGA CGAAGAAGAA GACTATCCAA GAAGTGAAAA AAGTGGAAAT 126 0 

AGTCAGAAAA GTCGACGTAG TAGAATGAGC ATTTCAACGA GATCTTCTGG CGATGAATCA 1320 
GCATCTCGTC CTAGAAAATG CGGATTTGTA GATTTAAACT CACTTCGTCA GAGACATGAT 

30 

AGCTTCAGAA GTTCTGTTGG GACATATTCT ATGAATAGTT CTAGACAAAC CAGAGACACT 



1380 



1440 



GAAAAGGATG AATTCTGGAA GGTTCTTCGA AAACAATCAG TTCCAGCAAA CATTCTATCA 
35 CTTCAACTTC AACAAATGGC TGCTAACCTG GAAAATGATG AAGACGTACC ACAAGTCACC 

GGGCATCAGT TCTCTGTCCT CGGTCAGATT CATATTGATC TCTCACGATA TCATGAGCTC 1620 



1500 



1560 
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GGGCGGTTCG TAGAAGTTGA TTCAGAACAT AAGGAAATGC TTGAGGGAAG TGAAAATGAC 
GCTCGTGTAC CAATCAAATA CGACAAGCAG TCTGCAATTT TCCATTTGGA TATCGCTCGG 
AAGTGTGGAA TCCTTGAGGC TGTGCTAACA TCGGCTCATA TTGTTCTCGG ATTACCACAT 
GAATTGTTGA AAGAAGTCAC CGTTGATGAT CTGTTTCCTA ATGGGTTTGG AGAACAGGAA 
AATGGAATTC GAGACCTAGA AGAGTTCGGC TCCGATCTGA TGGAAATTGC TGCAGAGATG 
GGTGATAAGG GTGCAATGCT GTACATGGCA CACGCTTATG AAACTGGTCA GCATCTCGGA 
CCGAATCGAA GAACGGATTA TAAGAAATCG ATTGATTGGT ATCAACGCGT CGTTGGATTC 
CAAGAAGAAG AAGAACTTGA CTCTGATTGT GGAAAAACGA CATTCTCCTC ATTTGCTCCA 
CTGACTCGTC ACGAGATTCT AG C CAAAATG GCTGAAATGT ACAAAGAGGG AGGTTATGGC 
CTGAATCAAG ACTTCGAACG AGCATATGGT CTATTCAATG AAGCTGCTGA AGCAGCAATG 
GAAGCAATGA ATGGAAAGCT CGCAAATAAA TACTATGAAA AAGCGGAAAT GTGTGGAGAA 
TGA 

(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) .HYPOTHETICAL: NO 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: C. elegans 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Thr lie Asp Thr Thr Asn Glu Ser Asp Asn Ser Pro Thr Asn Ser 
1-5 io 15 

Pro Gly Leu Glu Ala Ser Ala Arg Thr Phe Ser Leu Asn Ala Ser Lys 
20 25 30 

Met Val Arg lie Thr Asp Asp Tyr Ala Asp Glu Val Phe lie Glu Gin 
35 40 45 

Asn Asp Val Val lie Glu Lys Pro Arg Met Asp Pro Leu His Val Arg 
50 ' 55 60 

Lys Leu Met Glu Thr Trp Arg Lys Ala Ala Arg Arg Ala Arg Thr Asn 
65 70 75 80 

Tyr lie Asp Pro Trp Asp Glu Phe Asn lie His Glu Tyr Pro Val Gin 
85 90 95 

Arg Ala Lys Arg Tyr Arg -Tyr Ser Ala lie Arg Lys Gin Trp Thr Glu 
100 105 110 

Asp lie Val Asp Val Arg Leu His Pro Asp Ser Phe Ala Arg Gly Ala 
115 120 125 

Met Arg Glu Cys Tyr Arg Leu Lys Lys Cys Ser Lys His Gly Thr Ser 
130 135 140 

Gin Asp Trp Ser Ser Asn Tyr Val Ala Lys Arg Tyr He Cys Gin Val 
145 150 155 160 

Asp Arg Arg Val Leu Phe Asp Asp Val Arg Leu Gin Met Asp Ala Lys 
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165 170 175 

Leu Trp Ala Glu Glu Tyr Asn Arg Tyr Asn Pro Pro Lys Lys lie Asp 
180 185 190 

lie Val Gin Met Cys Val lie Glu Met lie Asp Val Lys Gly Ser Pro 
195 200 205 

Leu Tyr His Leu Glu His Phe lie Glu Gly Lys Tyr lie Lys Tyr Asn 
210 215 220 

Ser Asn Ser Gly Phe Val Ser Asn Ala Ala Arg Leu Thr Pro Gin Ala 
225 230 235 240 

Phe Ser His Phe Thr Phe Glu Arg Ser Gly His Gin Met Met Val Val 
245 250 255 

Asp lie Gin Gly Val Gly Asp Leu Tyr Thr Asp Pro Gin lie His Thr 
260 265 270 

Val Val Gly Thr Asp Tyr Gly Asp Gly Asn Leu Gly Thr Arg Gly Met 
275 280 285 

Ala Leu Phe Phe His Ser His Arg Cys Asn Asp lie Cys Glu Thr Met 
290 -295 300 

Asp Leu Ser Asn Phe Glu Leu Ser Pro Pro Glu lie Glu Ala Thr Glu 
305 310 315 320 

Val Ala Met Glu Val Ala Ala Lys Gin Lys Lys Ser Cys lie Val Pro 
325 330 335 

Pro Thr Val Phe Glu Ala Arg Arg Asn Arg He Ser Ser Glu Cys Val 
340 345 350 

His Val Glu His Gly He Ser Met Asp Gin Leu Arg Lys Arg Lys Thr 
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Leu Asn Gin Ser Ser Thr Asp Leu Ser Ala Lys Ser His Asn Glu Asp 



370 375 



380 



Cys Val Cys Pro Glu Cys He Pro Val Val Glu Gin Leu Cys Glu Pro 

385 390 395 

J " 400 



Cys Ser Glu Asp Glu Glu Asp Glu Glu Glu 



405 410 



Asp Tyr Pro Arg Ser Glu 
415 



Lys Ser Gly Asn Ser Gin Lys Ser Arg Arg Ser Arg Met Ser He Ser 
420 425 430 

Thr Arg Ser Ser Gly Asp Glu Ser Ala Ser Arg Pro Arg Lys Cys Gly 



435 440 



445 



Phe Val Asp. Leu Asn Ser Leu Arg Gin Arg His Asp Ser Phe Arg Ser 

460 



450 455 



Ser Val Gly Thr Tyr Ser Met Asn 
465 47 0 



Ser Ser Arg Gin Thr Arg Asp Thr 
475 480 



Pro Ala 



Glu Lys Asp Glu Phe Trp Lys Val Leu Arg Lys Gin Ser Val 

485 490 495 

Asn He Leu Ser Leu Gin-Leu Gin Gin Met Ala Ala Asn Leu Glu Asn 
500 505 siO 

Asp Glu Asp Val Pro Gin Val Thr Gly His Gin Phe Ser Val Leu Gly 
515 520 525 



Gin He His He Asp Leu Ser Arg Tyr His Glu Leu Gly Arg Phe Val 

540 



530 535 



Glu Val Asp Ser Glu His Lys Glu Met Leu Glu Gly Ser Glu Asn Asp 
545 550 555 560 



a Arg Val Pro He Lys Tyr Asp Lys Gin Ser Ala He Phe His Leu 
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Asp lie Ala Arg 
580 

His lie Val Leu 
595 

Asp Asp Leu Phe 
610 

Asp Leu Glu Glu 
625 

Gly Asp Lys Gly 

Gin His Leu Gly 
660 

Trp Tyr Gin Arg 
675 



565 

Lys Cys Gly He 

Gly Leu Pro His 
600 

Pro Asn Gly Phe 
615 

Phe Gly Ser Asp 
630 

Ala Met Leu Tyr 
645 

Pro Asn Arg Arg 

Val Val Gly Phe 
680 



570 

Leu Glu Ala Val 
585 

Glu Leu Leu Lys 

Gly Glu Gin Glu 
620 

Leu Met Glu He 
635 

Met Ala His Ala 
650 

Thr Asp Tyr Lys 
665 

Gin Glu Glu Glu 



575 

Leu Thr Ser Ala 
590 

Glu Val Thr Val 
605 

Asn Gly He Arg 

Ala Ala Glu Met 
640 

Tyr Glu Thr Gly 
655 

Lys Ser He Asp 
670 

Glu Leu Asp Ser 
685 



Asp Cys Gly Lys Thr Thr Phe Ser Ser Phe Ala Pro Leu Thr Arg His 

690 -.695 700 

Glu He Leu Ala Lys Met Ala Glu Met Tyr Lys Glu Gly Gly Tyr Gly 

705 710 715 720 



Leu Asn Gin Asp Phe Glu Arg Ala Tyr Gly Leu Phe Asn Glu Ala Ala 
725 730 735 

Glu Ala Ala Met Glu Ala Met Asn Gly Lys Leu Ala Asn Lys Tyr Tyr 
740 745 750 

Glu Lys Ala Glu Met Cys Gly Glu 




101 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 628 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE : cDNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyostel ium discoideum 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
20 GTATTGTATG TGTTTCAATT GAGAAGACTC CATTTGCAAA GGGTAGTTGT AGAACAGCAC 60 

ATAAATTAAA GGATTGGAGT CAACCAGATC AAGGATTAGT TGGTAAATTT TCAACCAATA 12 0 

AAAAGACGAC AAG AG AT TC A T AC TTT AC AG ATGTATTGAT GCAAACATTT TGTGCTAAAT 180 

25 

GGGCAGAGAA ATTCAATGAA G CGAAACC AC CAAAACCAAT TACATTCTTA CCATCATACG 24 0 

TTTACGAATT GATTGATCAT CCACCACCTT ATCCAGTTTG TGGTGGTGAA CCATTCATTG 3 00 

30 AGGGAGATTA CAAGAAACAT AACAACAACA GTGGTTACGT TAGTAGTGAT GCTAGAAATA 360 ' 

CACCACAATC ATTCTCTCAT TTCTCATACG AACTCTCCAA TCATGAATTG TTGATCGTTG 420 

ATATCCAAGG TGTCAACGAT TTCTACACTG ATCCTCAAAT TCATACGAAA TCAGGCGAGG 480 

35 

GCTTTGGCGA GGGTAATTTG GGCGAGACGG GTTTCCACAA ATTCCTTCAA ACACACAAGT 540 
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GTAATCCAGT TTGTGACTTT TTAAAGTTGA AACCAATCAA TCAATCAAAG AAAGCCCTCC 
TAAGAGGTAC ATTACCCGTC GTACAATT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Dictyos tel ium discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

lie Val Cys Val Ser lie Glu Lys Thr Pro Phe Ala Lys Gly Ser Cys 
1 5 10 15 

Arg Thr Ala His Lys Leu Lys Asp Trp Ser Gin Pro Asp Gin Gly Leu 
20 25 30 

Val Gly Lys Phe Ser Thr Asn Lys Lys Thr Thr Arg Asp Ser Tyr Phe 
35 40 45 

Thr Asp Val Leu Met Gin Thr Phe Cys Ala Lys Trp Ala Glu Lys Phe 
50 55 60 

Asn Glu Ala Lys Pro Pro Lys Pro lie Thr Phe Leu Pro Ser Tyr Val 
65 70 75 .80 
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Tyr Glu Leu lie Asp His Pro Pro Pro Tyr Pro Val Cys Gly Gly Glu 
85 90 95 

Pro Phe lie Glu Gly Asp Tyr Lys Lys His Asn Asn Asn Ser Gly Tyr 
100 105 110 

Val Ser Ser Asp Ala Arg Asn Thr Pro Gin Ser Phe Ser His Phe Ser 
115 120 125 

Tyr Glu Leu Ser Asn His Glu Leu Leu lie Val Asp lie Gin Gly Val 
130 135 140 

Asn Asp Phe Tyr Thr Asp Pro Gin lie His Thr Lys Ser Gly Glu Gly 
145 150 155 160 

Phe Gly Glu Gly Asn Leu Gly Glu Thr Gly Phe His Lys Phe Leu Gin 
165 170 175 

Thr His Lys Cys Asn Pro Val Cys Asp Phe Leu Lys Leu Lys Pro lie 
180 185 190 

Asn Gin Ser Lys Lys Ala Leu Leu Arg Gly Thr Leu Pro Val Val Gin 
195 200 205 



Leu 



INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Glu Trp Leu Asp Asp Glu Val Leu lie Lys Met Ala Ser Gin Pro 
1 5 10 15 

Phe Gly Arg Gly Ala Met Arg Glu Cys Phe Arg Thr Lys Lys Leu Ser 
20 25 30 

Asn Phe Leu His Ala Gin Gin Trp Lys Gly Ala Ser Asn Tyr Val Ala 
35 40 45 

Lys Arg Tyr lie Glu Pro Val Asp Arg Asp Val Tyr Phe Glu Asp Val 
50 55 60 

Arg Leu Gin Met Glu Ala Lys Leu Trp Gly Glu Glu Tyr Asn Arg His 
65 70 75 80 

Lys Pro Pro Lys Gin Val.Asp lie Met Gin Met Cys lie lie Glu Leu 
85 90 95 

Lys Asp Arg Pro Gly Lys Pro Leu Phe His Leu Glu His Tyr lie Glu 
100 105 110 

Gly Lys Tyr lie Lys Tyr Asn Ser Asn Ser Gly Phe Val Arg Asp Asp 
115 120 125 

Asn lie Arg Leu Thr Pro Gin Ala Phe Ser His Phe Thr Phe Glu Arg 
130 135 140 



Ser Gly His Gin Leu lie Val Val Asp lie Gin Gly Val Gly Asp • Leu 
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145 150 155 160 

Tyr Thr Asp Pro Gin lie His Thr Glu Thr Gly Thr Asp Phe Gly Asp 
165 170 175 

5 

Gly Asn Leu Gly Val Arg Gly Met Ala Leu Phe Phe Tyr Ser His Ala 
180 185 190 

Cys Asn Arg lie Cys Glu Ser Met Gly Leu Ala Pro Phe Asp Leu Ser 
10 195 200 205 

D 

yj'i Pro Arg Glu Arg Asp Ala Val Asn Gin Asn Thr Lys Leu Leu Gin Ser 

210 215 220 

. it: 

y,; 

31 = 

15 Ala Lys Thr lie Leu Arg Gly Thr Glu Glu Lys Cys Gly Ser 

^ 225 230 235 

N : (2) INFORMATION FOR SEQ ID NO: 16: 

fU 



20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: D . discoideum 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Asn Lys Trp lie Arg Leu Ser Met Lys Leu Lys Val Glu Arg Lys Pro 
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15 10 15 

Phe Ala Glu Gly Ala Leu Arg Glu Ala Tyr His Thr Val Ser Leu Gly 
20 25 30 

Val Gly Thr Asp Glu Asn Tyr Pro Leu Gly Thr Thr Thr Lys Leu Phe 
35 40 45 

Pro Pro He Glu Met He Ser Pro He Ser Lys Asn Asn Glu Ala Met 
50 55 60 

Thr Gin Leu Lys Asn Gly Thr Lys Phe Val Leu Lys Leu Tyr Lys Lys 
65 70 75 80 

Glu Ala Glu Gin Gin Ala Ser Arg Glu Leu Tyr Phe Glu Asp Val Lys 
85 90 95 

Met Gin Met Val Cys Arg Asp Trp Gly Asn Lys Phe Asn Gin Lys Lys 
100 105 no 

Pro Pro Lys Lys lie Glu Phe Leu Met Ser Trp Val Val Glu Leu lie 
H5 120 125 

Asp Arg Ser Pro Ser Ser Asn Gly Gin Pro He Leu Cys Ser He Glu 
130 -135 140 

Pro Leu Leu Val Gly Glu Phe Lys Lys Asn Asn Ser Asn Tyr Gly Ala 
145 150 155 160 

Val Leu Thr Asn Arg Ser Thr Pro Gin Ala Phe Ser His Phe Thr Tyr 
165 170 175 

Glu Leu Ser Asn Lys Gin Met He Val Val Asp He Gin Gly Val Asp 
180 185 190 



Asp Leu Tyr Thr Asp Pro Gin He His Thr Pro Asp Gly Lys Gly Phe 
195 200 205 
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Gly Leu Gly Asn Leu Gly Lys Ala Gly He Aan*Lys Phe II 



210 



e Thr Thr 



215 



220 



His Lys Cys Asn Ala Val Cys Ala 



225 



Leu Leu Asp Leu Asp Val Lys Leu 



230 



235 



240 



Gly Gly Val Leu Ser Gly Asn Asn Lys Lys Gin Leu Gin Gin Gly Thr 
Met Val 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: D. discoideum 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ala Gin Trp Thr Cys Thr Ala Thr Leu Val Lys Val Glu Pro Val Pro 
1 

Phe Ala Glu Gly Ala Phe Arg Lys Ala Tyr His Thr Leu Asp Leu Ser 
20 
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Lys Ser Gly Ala Ser Gly Arg Tyr Val Ser Lys lie Gly Lys Lys Pro 
35 40 45 

Thr Pro Arg Pro Ser Tyr Phe Glu Asp Val Lys Met Gin Met lie Ala 
50 55 60 

Lys Lys Trp Ala Asp Lys Tyr Asn Ser Phe Lys Pro Pro Lys Lys lie 
65 70 75 80 

Glu Phe Leu Gin Ser Cys Val Leu Glu Phe Val Asp Arg Thr Ser Ser 
85 90 95 

Asp Leu lie Cys Gly Ala Glu Pro Tyr Val Glu Gly Gin Tyr Arg Lys 
100 105 110 

Tyr Asn Asn Asn Ser Gly Phe Val Ser Asn Asp Glu Arg Asn Thr Pro 
115 120 125 

Gin Ser Phe Ser His Phe Thr Tyr Glu His Ser Asn His Gin Leu Leu 
130 135 140 

lie lie Asp lie Gin Gly Val Gly Asp His Tyr Thr Asp Pro Gin lie 
145 150 155 160 

His Thr Tyr Asp Gly Val Gly Phe Gly lie Gly Asn Leu Gly Gin Lys 
165 170 175 

Gly Phe Glu Lys Phe Leu Asp Thr His Lys Cys Asn Ala lie Cys Gin 
180 185 190 

Tyr Leu Asn Leu Gin Ser lie Asn Pro Lys Ser Glu Lys Ser Asp Cys 
195 200 205 

Gly Thr Val Pro 
210 



INFORMATION FOR SEQ ID NO: 18 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: C. elegans 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Lys Gin Trp Thr Glu Asp He Val Asp Val Arg Leu His Pro Asp Ser 
15 10 15 

Phe Ala Arg Gly Ala Met Arg Glu Cys Tyr Arg Leu Lys Lys Cys Ser 
20 25 30 

Lys His Gly Thr Ser Gin Asp Trp Ser Ser Asn Tyr Val Ala Lys Arg 
35 40 45 



Tyr He Cys Gin Val Asp Arg Arg Val Leu Phe Asp Asp Val Arg Leu 
50 55 60 



ro 



Gin Met Asp Ala Lys Leu Trp Ala Glu Glu Tyr Asn Arg Tyr Asn P 
65 70 75 so 

Pro Lys Lys lie Asp He Val Gin Met Cys Val He Glu Met He Asp 
85 90 95 

Val Lys Gly Ser Pro Leu Tyr His Leu Glu His Phe He Glu Gly Lys 
100 los 110 
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Tyr He Lys Tyr Asn Ser Asn Ser Gly Phe Val Ser Asn Ala Ala Arg 
115 120 125 

Leu Thr Pro Gin Ala Phe Ser His Phe Thr Phe Glu Arg Ser Gly His 
130 135 140 



Gin Met Met Val Val Asp lie Gin Gly Val Gly Asp Leu Tyr Thr Asp 

160 



145 iso 155 



Pro Gin He His Thr Val Val Gly Thr Asp Tyr Gly Asp Gly Asn Leu 
165 170 175 

Gly Thr Arg Gly Met Ala Leu Phe Phe His Ser His Arg Cys Asn Asp 
180 185 190 

He Cys Glu Thr Met Asp Leu Ser Asn Phe Glu Leu Ser Pro Pro Glu 
i95 200 205 

He Glu Ala Thr Glu Val Ala Met Glu Val Ala Ala Lys Gin Lys Lys 
210 215 220 

Ser Cys lie Val Pro Pro Thr Val Phe 
225 230 

) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotide Primer D" 



(iii) HYPOTHETICAL: NO 



sl:< 




111 



+ 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 19; 
5 GGATTTGGAC TGGACAAGAA CCCCC 

(2) INFORMATION FOR SEQ ID NO: 20: 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



m 

U1 (iii) HYPOTHETICAL: NO 

S 

-ru 20 

LU (xi) SEQUENCE DESCRIPTION: SEQ ID 



NO: 20: 



25 



25 



Arg Lys Lys Phe Gly Glu Ser Glu Lys Thr Lys Thr Lys Glu 



Phe Leu 

5 10 15 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Leu Thr Pro Gin Ala Phe Ser His Phe Thr Phe Glu Arg 
15 10 

10 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 
U'l (D) TOPOLOGY: linear 

y- (ii) MOLECULE TYPE: peptide 



20 



p (iii) HYPOTHETICAL: NO 

Mi 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Leu Ala Asn Xaa Tyr Tyr Glu Lys Ala Glu 
15 10 

30 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucleotides 



( i i i ) HYPOTHETICAL : 



NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO -23- 

10 

CANGCNTTNN NNCANTTNAC NTTNGANNG 
(2) INFORMATION FOR SEQ ID NO: 24: 

15 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE : nucleic acid 
CC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



20 



25 



30 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "Oligonucleotides" 



(iii) HYPOTHETICAL : 



NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TCNGCNTTNT CNTANTANTT NTTNGC 
(2) INFORMATION FOR SEQ ID NO: 25: 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pair 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Oligonucl 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TACAATCAGC TGATGACCAG AACGCTC 



